92 research outputs found

    Malin: maximum likelihood analysis of intron evolution in eukaryotes

    Get PDF
    Summary: Malin is a software package for the analysis of eukaryotic gene structure evolution. It provides a graphical user interface for various tasks commonly used to infer the evolution of exon–intron structure in protein-coding orthologs. Implemented tasks include the identification of conserved homologous intron sites in protein alignments, as well as the estimation of ancestral intron content, lineage-specific intron losses and gains. Estimates are computed either with parsimony, or with a probabilistic model that incorporates rate variation across lineages and intron sites

    New method to determine FAO number of maize, Zea mays L.

    Get PDF
    FAO numbers are generally calculated from the grain moisture at harvest, which has decreased substantially in recent decades. In many countries maize is now harvested with a grain moisture of around 20 %. However, the lower the grain moisture at harvest, the smaller the difference in grain moisture between the maturity groups and/or individual hybrids. The reliability of grain moisture measurements has not improved parallel to the decline in the differences between hybrids, making it difficult to determine the maturity dates of the hybrids reliably. A new method has been elaborated to solve this problem and has been successfully used for the last two years in official trials in Hungary. The new method has several advantages: (a) more maturity parameters are taken into consideration, so the evaluation of more data improves reliability, (b) regression between the maturity parameters and the FAO number is calculated using several standards, thus reducing the effect of the G x E interaction and the experimental error. As a result, the annual fluctuation in the FAO number for each 1 % grain moisture is reduced

    A Detailed History of Intron-rich Eukaryotic Ancestors Inferred from a Global Survey of 100 Complete Genomes

    Get PDF
    Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6–7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing

    Streamlining and Large Ancestral Genomes in Archaea Inferred with a Phylogenetic Birth-and-Death Model

    Get PDF
    Homologous genes originate from a common ancestor through vertical inheritance, duplication, or horizontal gene transfer. Entire homolog families spawned by a single ancestral gene can be identified across multiple genomes based on protein sequence similarity. The sequences, however, do not always reveal conclusively the history of large families. To study the evolution of complete gene repertoires, we propose here a mathematical framework that does not rely on resolved gene family histories. We show that so-called phylogenetic profiles, formed by family sizes across multiple genomes, are sufficient to infer principal evolutionary trends. The main novelty in our approach is an efficient algorithm to compute the likelihood of a phylogenetic profile in a model of birth-and-death processes acting on a phylogeny

    Nonsense-Mediated Decay Enables Intron Gain in Drosophila

    Get PDF
    Intron number varies considerably among genomes, but despite their fundamental importance, the mutational mechanisms and evolutionary processes underlying the expansion of intron number remain unknown. Here we show that Drosophila, in contrast to most eukaryotic lineages, is still undergoing a dramatic rate of intron gain. These novel introns carry significantly weaker splice sites that may impede their identification by the spliceosome. Novel introns are more likely to encode a premature termination codon (PTC), indicating that nonsense-mediated decay (NMD) functions as a backup for weak splicing of new introns. Our data suggest that new introns originate when genomic insertions with weak splice sites are hidden from selection by NMD. This mechanism reduces the sequence requirement imposed on novel introns and implies that the capacity of the spliceosome to recognize weak splice sites was a prerequisite for intron gain during eukaryotic evolution

    Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences

    Get PDF
    Profiling phylogenetic marker genes, such as the 16S rRNA gene, is a key tool for studies of microbial communities but does not provide direct evidence of a community’s functional capabilities. Here we describe PICRUSt (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States), a computational approach to predict the functional composition of a metagenome using marker gene data and a database of reference genomes. PICRUSt uses an extended ancestral-state reconstruction algorithm to predict which gene families are present and then combines gene families to estimate the composite metagenome. Using 16S information, PICRUSt recaptures key findings from the Human Microbiome Project and accurately predicts the abundance of gene families in host-associated and environmental communities, with quantifiable uncertainty. Our results demonstrate that phylogeny and function are sufficiently linked that this ‘predictive metagenomic’ approach should provide useful insights into the thousands of uncultivated microbial communities for which only marker gene surveys are currently available

    A poxvirus Bcl-2-like gene family involved in regulation of host immune response: sequence similarity and evolutionary history

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Poxviruses evade the immune system of the host through the action of viral encoded inhibitors that block various signalling pathways. The exact number of viral inhibitors is not yet known. Several members of the vaccinia virus A46 and N1 families, with a Bcl-2-like structure, are involved in the regulation of the host innate immune response where they act non-redundantly at different levels of the Toll-like receptor signalling pathway. N1 also maintains an anti-apoptotic effect by acting similarly to cellular Bcl-2 proteins. Whether there are related families that could have similar functions is the main subject of this investigation.</p> <p>Results</p> <p>We describe the sequence similarity existing among poxvirus A46, N1, N2 and C1 protein families, which share a common domain of approximately 110-140 amino acids at their C-termini that spans the entire N1 sequence. Secondary structure and fold recognition predictions suggest that this domain presents an all-alpha-helical fold compatible with the Bcl-2-like structures of vaccinia virus proteins N1, A52, B15 and K7. We propose that these protein families should be merged into a single one. We describe the phylogenetic distribution of this family and reconstruct its evolutionary history, which indicates an extensive gene gain in ancestral viruses and a further stabilization of its gene content.</p> <p>Conclusions</p> <p>Based on the sequence/structure similarity, we propose that other members with unknown function, like vaccinia virus N2, C1, C6 and C16/B22, might have a similar role in the suppression of host immune response as A46, A52, B15 and K7, by antagonizing at different levels with the TLR signalling pathways.</p

    Genome of <i>Leptomonas pyrrhocoris</i>:a high-quality reference for monoxenous trypanosomatids and new insights into evolution of <i>Leishmania</i>

    Get PDF
    Many high-quality genomes are available for dixenous (two hosts) trypanosomatid species of the genera Trypanosoma, Leishmania, and Phytomonas, but only fragmentary information is available for monoxenous (single-host) trypanosomatids. In trypanosomatids, monoxeny is ancestral to dixeny, thus it is anticipated that the genome sequences of the key monoxenous parasites will be instrumental for both understanding the origin of parasitism and the evolution of dixeny. Here, we present a high-quality genome for Leptomonas pyrrhocoris, which is closely related to the dixenous genus Leishmania. The L. pyrrhocoris genome (30.4 Mbp in 60 scaffolds) encodes 10,148 genes. Using the L. pyrrhocoris genome, we pinpointed genes gained in Leishmania. Among those genes, 20 genes with unknown function had expression patterns in the Leishmania mexicana life cycle suggesting their involvement in virulence. By combining differential expression data for L. mexicana, L. major and Leptomonas seymouri, we have identified several additional proteins potentially involved in virulence, including SpoU methylase and U3 small nucleolar ribonucleoprotein IMP3. The population genetics of L. pyrrhocoris was also addressed by sequencing thirteen strains of different geographic origin, allowing the identification of 1,318 genes under positive selection. This set of genes was significantly enriched in components of the cytoskeleton and the flagellum

    Mineralogical attenuation for metallic remediation in a passive system for mine water treatment

    Get PDF
    Passive systems with constructed wetlands have been consistently used to treat mine water from abandoned mines. Long-term and cost-effective remediation is a crucial expectation for these water treatment facilities. To achieve that, a complex chain of physical, chemical, biological, and mineralogical mechanisms for pollutants removal must be designed to simulate natural attenuation processes. This paper aims to present geochemical and mineralogical data obtained in a recently constructed passive system (from an abandoned mine, Jales, Northern Portugal). It shows the role of different solid materials in the retention of metals and arsenic, observed during the start-up period of the treatment plant. The mineralogical study focused on two types of materials: (1) the ochre-precipitates, formed as waste products from the neutralization process, and (2) the fine-grained minerals contained in the soil of the wetlands. The ochre-precipitates demonstrated to be poorly ordered iron-rich material, which gave rise to hematite upon artificial heating. The heating experiments also provided mineralogical evidence for the presence of an associated amorphous arsenic-rich compound. Chemical analysis on the freshly ochre-precipitates revealed high concentrations of arsenic (51,867 ppm) and metals, such as zinc (1,213 ppm) and manganese (821 ppm), indicating strong enrichment factors relative to the water from which they precipitate. Mineralogical data obtained in the soil of the wetlands indicate that chlorite, illite, chlorite–vermiculite and mica–vermiculite mixedlayers, vermiculite, kaolinite and goethite are concentrated in the fine-grained fractions (<20 and <2 μm). The chemical analyses show that high levels of arsenic (up to 3%) and metals are also retained in these fractions, which may be enhanced by the low degree of order of the clay minerals as suggested by an XRD study. The obtained results suggest that, although the treatment plant has been receiving water only since 2006, future performance will be strongly dependent on these identified mineralogical pollutant hosts.Fundação para a Ciência e a Tecnologia (FCT

    Algebraic Distribution of Segmental Duplication Lengths in Whole-Genome Sequence Self-Alignments

    Get PDF
    Distributions of duplicated sequences from genome self-alignment are characterized, including forward and backward alignments in bacteria and eukaryotes. A Markovian process without auto-correlation should generate an exponential distribution expected from local effects of point mutation and selection on localised function; however, the observed distributions show substantial deviation from exponential form – they are roughly algebraic instead – suggesting a novel kind of long-distance correlation that must be non-local in origin
    corecore